NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An ORSAC method for data cleaning inspired by RANSAC

https://doi.org/10.11591/ijict.v13i3.pp484-498

Jenkins, Thomas; Goodwin, Autumn; Talafha, Sameerah (December 2024, International Journal of Informatics and Communication Technology (IJ-ICT))

In classification problems, mislabeled data can have a dramatic effect on the capability of a trained model. The traditional method of dealing with mislabeled data is through expert review. However, this is not always ideal, due to the large volume of data in many classification datasets, such as image datasets supporting deep learning models, and the limited availability of human experts for reviewing the data. Herein, we propose an ordered sample consensus (ORSAC) method to support data cleaning by flagging mislabeled data. This method is inspired by the random sample consensus (RANSAC) method for outlier detection. In short, the method involves iteratively training and testing a model on different splits of the dataset, recording misclassifications, and flagging data that is frequently misclassified as probably mislabeled. We evaluate the method by purposefully mislabeling subsets of data and assessing the method’s capability to find such data. We demonstrate with three datasets, a mosquito image dataset, CIFAR-10, and CIFAR-100, that this method is reliable in finding mislabeled data with a high degree of accuracy. Our experimental results indicate a high proficiency of our methodology in identifying mislabeled data across these diverse datasets, with performance assessed using different mislabeling frequencies.
more » « less
Full Text Available
Mosquito species identification accuracy of early deployed algorithms in IDX, A vector identification tool

https://doi.org/10.1016/j.actatropica.2024.107392

Gupta, Khushi Anil; Ikonomidou, Vasiliki N; Glancey, Margaret; Faiman, Roy; Talafha, Sameerah; Ford, Tristan; Jenkins, Thomas; Goodwin, Autumn (December 2024, Acta Tropica)

Mosquito-borne diseases continue to pose a great threat to global public health systems due to increased insecticide resistance and climate change. Accurate vector identification is crucial for effective control, yet it presents significant challenges. IDX - an automated computer vision-based device capable of capturing mosquito images and outputting mosquito species ID has been deployed globally resulting in algorithms currently capable of identifying 53 mosquito species. In this study, we evaluate deployed performance of the IDX mosquito species identification algorithms using data from partners in the Southeastern United States (SE US) and Papua New Guinea (PNG) in 2023 and 2024. This preliminary assessment indicates continued improvement of the IDX mosquito species identification algorithms over the study period for individual species as well as average regional accuracy with macro average recall improving from 55.3 % [Confidence Interval (CI) 48.9, 61.7] to 80.2 % [CI 77.3, 84.9] for SE US, and 84.1 % [CI 75.1, 93.1] to 93.6 % [CI 91.6, 95.6] for PNG using a CI of 90 %. This study underscores the importance of algorithm refinement and dataset expansion covering more species and regions to enhance identification systems thereby reducing the workload for human experts, addressing taxonomic expertise gaps, and improving vector control efforts.
more » « less
Full Text Available
Accelerated steady-state electrostatic particle-in-cell simulation of Langmuir probes

https://doi.org/10.1063/5.0072994

Werner, Gregory R.; Robertson, Scott; Jenkins, Thomas G.; Chap, Andrew M.; Cary, John R. (January 2022, Physics of Plasmas)

Full Text Available
Dispersion and the speed-limited particle-in-cell algorithm

https://doi.org/10.1063/5.0046935

Jenkins, Thomas G.; Werner, Gregory R.; Cary, John R. (June 2021, Physics of Plasmas)
null (Ed.)
Full Text Available
Computing the Paschen curve for argon with speed-limited particle-in-cell simulation

https://doi.org/10.1063/5.0051095

Theis, Joseph G.; Werner, Gregory R.; Jenkins, Thomas G.; Cary, John R. (June 2021, Physics of Plasmas)
null (Ed.)
Full Text Available
Speeding up simulations by slowing down particles: Speed-limited particle-in-cell simulation

https://doi.org/10.1063/1.5061683

Werner, Gregory R.; Jenkins, Thomas G.; Chap, Andrew M.; Cary, John R. (December 2018, Physics of Plasmas)

Full Text Available
TRY plant trait database – enhanced coverage and open access

https://doi.org/10.1111/gcb.14904

Kattge, Jens; Bönisch, Gerhard; Díaz, Sandra; Lavorel, Sandra; Prentice, Iain Colin; Leadley, Paul; Tautenhahn, Susanne; Werner, Gijsbert D.; Aakala, Tuomas; Abedi, Mehdi; et al (December 2019, Global Change Biology)

Full Text Available

Search for: All records